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Abstract 
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1 Introduction 



Dynkin games (Dynkin, 1969) serve as a model of optimal stopping. These games were 
applied in various setups, including wars of attrition (see, e.g., Maynard Smith (1974), 
Ghemawat and Nalebuff (1985) and Hendricks et al. (1988)), pre-emption games (see, e.g., 
Fudenberg and Tirole (1991, section 4.5.3)), duels (see, e.g., Blackwell (1949), Bellman and 
Girshick (1949), Shapley (1951), Karlin (1959), and the survey by Radzik and Raghavan 
(1994)), and pricing of options (Kifer (2000), Hamadene (2006)). 

The existence of a value in randomized strategies in Dynkin games, in its general setting, 
has been settled only recently (see Rosenberg, Solan and Vieille (2001) for discrete time 
games, and Laraki and Solan (2005), for continuous time games). The existence of an e- 
equilibrium in randomized strategies in non-zero-sum games has been proven for two-player 
games in discrete time (Shmaya and Solan, 2004), and for games in continuous time under 
certain conditions (see, e.g., Laraki, Solan and Vieille, 2005). 

In the present paper we prove that every two-player non-zero-sum Dynkin game in 
continuous time admits an e-equilibrium in randomized strategies, for every e > 0. We 
further show how such an equilibrium can be constructed, and we provide a condition 
under which there exists an e-equilibrium in non-randomized strategies. Rather than using 
the Snell envelope, as, e.g., in Hamadene and Zhang (2010), our technique is to use results 
from zero-sum games. 

We note that three-player Dynkin games in continuous time may fail to admit an e- 
equilibrium in randomized strategies, even if the payoff processes are constant (Laraki, 
Solan and Vieille, 2005, Section 5.2). Thus, our result completes the mapping of Dynkin 
games in continuous time that admit an e-equilibrium in randomized stopping times. 

The paper is organized as follows. The model and the main results appear in Section 
[2j In Section [3] we review known results regarding zero-sum games that are then used in 
Section 0] to prove the main theorem. 
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2 Model and Results 



Let (f2, A, P) be a probability space, and let J- = (J 7 t)t>o be a nitration in continuous time 
that satisfies "the usual conditions". That is, J 7 is right continuous, and J-q contains all 
P-null sets: for every B G A with P(B) = and every A C B, one has A G J^q. All stopping 
times in the sequel are w.r.t. the filtration J-. 

Denote Foo := Vt>oFt- Assume without loss of generality that Foo = A. Hence (fl, A, P) 
is a complete probability space. 

Let (Xi, Yi, 2i)t=i,2 be uniformly bounded J'-adapted real-valued processes^ and let 
(Ci)i=l,2 be two bounded real-valued J r 00 -measurable functions. In the sequel we will assume 
that the processes (X; L , Yi, Z; L )i = i^ are right continuous. 

Definition 1 A two-player non-zero-sum Dynkin game over (£l,A,P,J-) with payoffs 
(Xi,Yi, Zi,£i)i = i t 2 is the game with player set N = {1,2}, the set of pure strategies of each 
player is the set of stopping times, and the payoff function of each player i £ {1, 2} is: 

7i(Al, A 2 ) := E [Ai(Ai)l( Al<A2} + y i (A 2 )l{A 2 <A 1 } + ^(Ai)1{A 1= A 2 <oo} + £il{Ai=A 2 =oo}] , 

(1) 

where X± and A 2 are the stopping times chosen by the two players respectively. 

In words, the process Xi represents the payoff to player i if player 1 stops before player 
2, the process Yi represents the payoff to player i if player 2 stops before player 1, the 
process Zi represents the payoff to player i if the two players stop simultaneously, and the 
function £j represents the payoff to player i if no player ever stops. 

The game is zero-sum if X\ + X% = Y\ + Y2 = Z\ + Zi = £1 + £2 = 0. 

In non-cooperative game theory, a randomized strategy is a probability distribution over 

pure strategies, with the interpretation that at the outset of the game the player randomly 

chooses a pure strategy according to the probability distribution given by the randomized 

strategy, and uses it along the game. In the setup of Dynkin games in continuous time, a 

randomized strategy is a randomized stopping time, which is defined as follows. 

Our results hold for the larger class of T> payoff processes defined by Dellacherie and Meyer, 1975, §11-18. 
This class contains in particular integrable processes. 
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Definition 2 A randomized stopping time for player i is a measurable function ipi : [0, 1] x 
0, — > [0, +oo] such that the function <pi(r,-) : f2 — > [0,+oo] is a stopping time for every 
r € [0,1] (see Aumann (1964))- 

Here the interval [0, 1] is endowed with the Borel cr-field. For strategically equivalent 
definitions of randomized stopping times, see Touzi and Vieille (2002). The interpretation 
of Definition [2] is that player i chooses r in [0, 1] according to the uniform distribution and 
then stops at the stopping time <pi(r,-). Throughout the paper, the symbols A, \x and r 
stand for stopping times, and ip and tp stand for randomized stopping times. 

The expected payoff for player i that corresponds to a pair of randomized stopping times 
(<Pi,<P2) is: 

TiOi,^) := / TiOiO", -),P2(s, •)) dr ds, i = l,2. 

J[0,1] 2 

In the sequel we will also consider the expected payoff at a given time t. We therefore 
define for every t > and every pair of randomized stopping times <pi,<p2 t'- 

li(<fl,<P2 | Ft) ■= E[A"j(^i)l{ Vl<V2 }+yi(^2)l{ V2 < v , 1 }+^(^l)l{ </ , 1 = </ , 2 <oo}+61{ v , 1=¥ , 2 =oo} I Ft]- 

(2) 

A pair of randomized stopping times (<p*, <£>|) is an e-equilibrium if no player can profit 
more than e by deviating from <p*. 

Definition 3 Let e > 0. A pair of randomized stopping times (<p*,<P2) i> s an e-equilibrium 

if for every two randomized stopping times <pi,<f2 the following inequalities hold: 

7i(y>i,¥>2) < inivhvt) + e ' ( 3 ) 

and 

72(^1,^2) < 72(^1^2) + e - ( 4 ) 

Because of the linearity of the payoff function, Eqs. ([3]) and (Q hold for every randomized 
stopping time (p% and ip2 respectively as soon as they hold for non-randomized stopping 
times. 

Our goal in this paper is to prove the existence of an e-equilibrium in two-player non- 
zero-sum games, and to construct such an e-equilibrium. 
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Suppose that a player wants to stop at the stopping time A, but he would like to mask 
the exact time at which he stops (for example, so that the other player cannot stop at the 
very same moment as he does). To this end, he can stop at a randomly chosen time in a 
small interval [A, A + 5], and, since the payoff processes are right continuous, he will not lose 
(or gain) much relative to stopping at time A. This leads us to the following class of simple 
randomized stopping times that will be extensively used in the sequel. 

Definition 4 A randomized stopping time <p is simple if there exist a stopping time A 
and a T\-measurable non-negative function 5 > 0, such that for every r € [0, 1] one has 
<p(r, •) = A + rd. The stopping time A is called the basis of <p, and the function 5 is called 
the delay of (p. 

Since cp(r, •) > A and <p(r, •) is J^-measurable, by Dellacherie and Meyer (1975, §IV- 
56), (p(r, •) is a stopping time for every r € [0, 1]. Consequently, ip is indeed a randomized 
stopping time. 

Definition U] does not require that A is finitely on the set {A = oo} we have cp(r, •) = oo 
for every r € [0, 1]. On the set {5 = 0} the randomized stopping time (p that is defined 
in Definition U] stops at time A with probability 1. On the set {5 > 0} the stopping time 
is "non-atomic" yet finite, and in particular for every stopping time \x we have P({<5 > 

0} n {cp = n}) = o. 

We now state our main results. 

Theorem 5 Every two-player non-zero-sum Dynkin game with right- continuous and uni- 
formly bounded payoff processes admits an e-equilibrium in simple randomized stopping 
times, for every e > 0. 

Moreover, the delay of the simple randomized stopping time that constitute the e- 
equilibrium can be arbitrarily small. 

Theorem [5] was proved by Laraki and Solan (2005) for two-player zero-sum games. Our 

proof heavily relies on the results of Laraki and Solan (2005), and we use e-equilibria in 

zero-sum games to construct an e-equilibrium in the non-zero-sum game. 

2 A statement holds on a measurable set A if and only if the set of points in A that do not satisfy the 
statement has probability 0. 
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Under additional conditions on the payoff processes, the e-equilibrium is given in non- 
randomized stopping times. 

Theorem 6 Under the assumptions of Theorem^ if Z\{t) € co{Xi(t), Y±(t)} and Z 2 {t) £ 
co{X 2 {t), Y 2 {t)} for every t>0, then the game admits an e-equilibrium in non-randomized 
stopping times, for every e > 0. 

Hamadene and Zhang (2010) proved the existence of a 0-equilibrium in non-randomized 
stopping times under stronger conditions than those in Theorem [6l using the notion of Snell 
envelope of processes (see, e.g., El-Karoui (1980) for more details). 

The rest of the paper is devoted to the proofs of Theorems [5] and We will assume 
w.l.o.g. that the payoff processes are bounded between and 1. 

3 The Zero- Sum Case 

In the present section we summarize several results on zero-sum games, taken from Laraki 
and Solan (2005), that will be used in the sequel, and prove some new results on zero-sum 
games. 

For every t > denote 

vi(t) := ess-sup ¥ , 1 > t ess-inf A2 > t E[Xi(</?i)l { ^ 1<A2} + Yi(A 2 )l{A 2 <^ 1 } (5) 

+Zi((f 1 )l^ 1= x 2 <oo} + £l 1 { Vl =A 2 =oo} I Ft], 

where the supremum is over all randomized stopping times ipi >t, and the infimum is over 
all (non-randomized) stopping times A2 > t. This is the highest payoff that player 1 can 
guarantee in the zero-sum Dynkin game Ti(t) where the payoffs are those of player 1, player 
1 is the maximizer, player 2 is the minimizer, and the game starts at time t. Similarly, the 
highest payoff that player 2 can guarantee in the zero-sum Dynkin game T 2 (t) where the 
payoffs are those of player 2, player 2 is the maximizer, player 1 is the minimizer, and the 
game starts at time t, is given by: 

v 2 (t) := ess - sup^^ess - inf Al >iE[X 2 (Ai)l {Al<(p2 } +Y 2 (ip 2 )l{ !f2< \ 1 } (6) 

+ Z 2 (Ai)l{ Al=V2<00 } + ^2l{ Al = V2 =oo} I F t ]. 
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The next lemma, which is proved in Laraki and Solan (2005), states that Vi(t) (resp. 
V2{t)) is in fact the value of the zero-sum games T\{t) (resp. r 2 (i)). This lemma is proved 
in Laraki and Solan (2005) when Tt is the trivial c-algebra. Its proof can be adapted to a 
general Tt (see the discussion in Appendix |AJ . 

Lemma 7 

ui(t) = ess - inf^^tess - sup Al > t E[Xi(Ai)l {Al< ^ 2 }+yi(^2)l{v. 2 <Ai} (7) 
+Zi(Ai)l{ Al= ^ 2<00 } + £i1{a 1= i/, 2 =oo} I Ft], 

and 

v 2 (t) = ess-inf^ 1 > t ess-sup A2 > t E[X 2 ('0i)l{v.i<A2} + i 2(A2)l{A 2 <v.i} (8) 

+Z 2 (V>l)l{</, 1= A 2 <oo} + 61{Vi=A 2 =oo} I F], 

where the infimum in ^ is over all randomized stopping times ip2 > t for player 2, the 
supremum in ^7ty ^ s over all (non-randomized) stopping times Ai > t for player 1, the 
infimum in is over all randomized stopping times ifii > t for player 1, and the supremum 
in (0) is over all (non-randomized) stopping times A 2 > t for player 2. 

A stopping time (pi (resp. ipi) that achieves the supremum in ([5]) (resp. infimum in ([8])) 
up to e is called an e-optimal stopping time for player 1 in Ti(t) (resp. r 2 (i)). Similarly, a 
stopping time </? 2 (resp. ^ 2 ) that achieves the supremum in © (resp. infimum in ([7])) up 
to £ is called an e-optimal stopping time for player 2 in T 2 (i) (resp. Ti(i)). 

The proof of Laraki and Solan (2005, Proposition 7) can be adapted to show that the 
value process is right continuous (see Appendix |A|) . 

Lemma 8 The process (vj(i))t>o is right continuous, for each i G {1,2}. 

The following two lemmas provide crude bounds on the value process. 
Lemma 9 For every t > and each i = 1, 2 one has 

mm{Xi(t),Yi(t)} < Vi(t) < maxpQ(t) on CI. 
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Proof. We start by proving the left-hand side inequality for i = 2. Let e > be 
arbitrary, and let 5 > be sufficiently small such that 

P( sup \X 2 (t) - X 2 (t + p )\ >£ )<e, (9) 
pe[o,S] 

P( sup \Y 2 (t) - Y 2 (t + p )\ >£ )<e. (10) 
P e[o,<5] 

Such 8 exists because the processes X 2 and Y 2 are right continuous. 

Let (p 2 be the simple randomized stopping time <p 2 (r, •) = t + rd, and let Ai > t be any 
non-randomized stopping time for player 1 . The definition of ip 2 implies that the probability 
that Ai = ip2 is ; P(^i = ^2) = 0. Moreover, (p 2 < 00. Therefore 

72(Ai,<£> 2 I -T 7 *) = E[X 2 (Ai)l {Al<¥ , 2} +y 2 (^2)l{ ¥ , 2 <A 1 } I 

By © and (HUD this implies that 

P( 72 (Ai,95 2 I F t ) < mm{X 2 (t),Y 2 (t)} - e) < 2s. 

Because Ai is arbitrary, Eq. © implies that 

P{v 2 {t) < mm{X 2 {t),Y 2 {t)} - e) < 2e. 

The left-hand side inequality for i = 2 follows because e is arbitrary. 

The proof of the right-hand side-inequality for i = 2 follows the same arguments, by 
using the simple randomized stopping time (pi(r, •) = t + rS. Indeed, for every stopping 
time X 2 for player 2 we then have 

72(^1, A 2 I F) = E[X 2 (^i)l { ^ 1<A2} + y 2 (A 2 )l { ^ 1>A2} I F t ]. 

The same argument as above, using ([8]), delivers the desired inequality. The proof for i = 1 
is analogous. ■ 

Lemma 10 For every t > 0, one has 

vi(t) < m&x{Y 1 (t),Z 1 (t)} on 0, 
v 2 (t) < max{X 2 (t),Z 2 (t)} on O. 
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Proof. We prove the Lemma for i = 1. Let ip 2 = t: player 2 stops at time t. By ([7]), 

vi(t) < ess - sup Al > t 7i(Ai, ip 2 \?t)- 

Because for every (non-randomized) stopping time Ai for player 1, 71 (Ai,^ I -^t) is either 
Yx(t) (if Ai > t) or Z x {t) (if Ai = t), the result follows. ■ 

Following Lepeltier and Maingueneau (1984), for every 77 > let \x\ and //^ be the 
stopping times defined as follows: 

^:=inf{s>0: X^s) > Vl (s) - rj} , (11) 

and 

$ := inf{s > : Y 2 {s) > v 2 (s) - rj}. (12) 

As the following example shows, the stopping times ^ and ii\ may be infinite. Consider 
the following Dynkin game, where the payoffs are constants: X\ = 0, Y\ = Z\ = 2 and 
£1 = 1. Then v\{t) = 1 for every i, and [i\ = 00, provided r\ € (0, 1). 

Observe that fj^ — whenever r] > rj '. Moreover, because the processes X±, Y 2 , v± 
and v 2 are right continuous, we have 

X 1 (^ 1 )>v 1 ^ 1 )- V , (13) 

and 

Y 2 (j4)>V2(j4)-ri. (14) 
For every t < /i^, by the definition of {i\ and Lemma [9J we have 

X 1 (t) < Vl (t)-r]< Vl (t) < mzxiXxit)^)}, 

and therefore 

Y 1 (t)>X 1 (t), Vt<M?. (15) 
The analogous inequality for player 2 holds as well. 

Lemma 11 Let e,r] > 0, let r be a stopping time, and let A € T T satisfy P(A \ {//^ = 
00}) < e. Then 

B[ Vi (t)1 a ] < E[61^ nK=oo} ] + 3e + 6e/ V . (16) 
9 



Proof. Let ip2 = oo: player 2 never stops. By ((7J), 

vi(t) < ess - sup Al > r 7i(Ai, ip 2 I •?>)• (17) 

Let Ai > r be a stopping time for player 1 that achieves the supremum in (|17p up to e. Let 
A^ be the following stopping time: 

• On A fl {Ai < oo}, X[ is an ry/2-optimal stopping time for player 1 in Ti(Ai). 

• Onin{Ai = oo}, X[ = oo. 

It follows that 

E[«i(r)l A ] < E[ 7l (A 1 ,V 2 \F T )l A ]+e 

= E[Xi(Ai)l J 4 n { Al<00 } + 6lAn{Ai=oo}] + e 

< E[(«i(Ai) - ??)lAn{Ai<^=oo} + -^l(^l) 1 yln{Ai<oo}n{Ai7<oo} + CllAn{Ai=oo}] + ^ 

< E[(ui(Ai) - ^Ia^a^oo} + £ilAn{A 1= oo}] + 3e 

< Eb^A;,^ I F t )1a] ~ ?E[l An{Al<oo} ] + 3e 



< E[ 7l (Ai,V 2 I ^t)1a] - 7jE[l An{Al<oo} ] +4e 



2 
2 

where the second inequality holds by the definition of /i^, the third inequality holds since 
P(A \ {fj^ = oo}) < e and since payoffs are bounded by 1, and the last inequality holds 
because Ai is e-optimal. 

This sequence of inequalities implies that 

P(^n{Ai < oo}) < 6e/r?, 

and therefore 

E[ Vl (r)l A ] < E[61 A nM=oc}] + 3e + 6e/ V , 

as desired. ■ 

By Lepeltier and Maingueneau (1984), for each i = 1, 2 the process Vi is a submartingale 
up to time 

Lemma 12 For every r\ > the process (vi(t)) t l is a submartingale: for every pair of 
finite stopping times A < A' < [i[ one has i>i(A) < E[t>i(A') | J-\] on £1. 

10 



Lemma [12] implies that before time sup r;>0 [i\ player 1 is better off waiting and not 
stopping. An analogue statement holds for player 2. 
Lemmas 1 1 1 1 and 1 1 2 1 deliver the following result. 

Lemma 13 Let rj > 0. For every stopping time X% that satisfies X\ < one has 

vi(Xi) < E[v 1 (fi v 1 )l {ll v <oo} + ^l w=oo} | JaJ. 

Proof. Let e > be arbitrary. By Lemma [T2"l for every t > one has 

vi(\i)<E[vi(Tam{fil,t})]. 

Let to be sufficiently large such that P(to < < oo) < e. By Lemma [TT] with t = to and 
A = {t < Ml}; 

Ebi(to)l {to <^ } ] < E[^l {t0 <^ =oo} ] + 3e + 66/r?. 

Therefore, 

«!(A X ) < E[vi(mm{/xJ,to})] 

= E[vi(/i;)l {/i j <to} + vi(t )l {to < M j } ] 

< E[wi(^)l { ^ <oo} + ai{^=oo}] + 5e + 6e/r?. 

The result follows since e is arbitrary. ■ 

The proof of Laraki and Solan (2005, Section 3.3) delivers the following result, which 
says that each player i has a simple randomized e-optimal stopping time that is based on 
[1%, provided r\ is sufficiently small. 

Lemma 14 For every i = 1,2, every e,n > 0, and every positive J-^n -measurable function 
Si, there exists a simple randomized stopping time tp 7 ? with basis [x\ and delay at most Si 
that satisfies 

Hfoi, h-i \^v)> v i{$) - e-r) onQ, (18) 
for every stopping time X^-i > fj%. 
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By Eq. §F5§, before time one has X 1 <Y 1 . When Xi(t) < Zi(t) < Yi(t) for every t, a 
non-randomized e-optimal stopping time exists (Lepeltier and Maingueneau, 1984). Laraki 
and Solan (2002, Section 4.1) use this observation to conclude the following. 

Lemma 15 If Zi{t) € co{Xi{i), Y{(t)} for every t > and each i = 1,2, then the simple 
randomized stopping time <pl in Lemma \TJ^ can be taken to be non-randomized ( that is, the 
delay of both players is 0). 

4 The Non- Zero- Sum Case 

In the present section we prove Theorems [5] and [6) Fix e > once and for all. 

Let 5q (resp. S\, 62) be a positive J>-measurable function that satisfies the following 
inequalities for each i € {1,2} and for the stopping time r = (resp. r = p![, r = p,^). 
Such 5o (resp. 81, S2) exists because the processes (Xi,Yi,Vi)i=i t 2 are right continuous. 

P( sup \Xi{r) - Xi(r + p)\ >e) < e, (19) 

pe[o,<5 ] 

P( sup \Y i {T)-Y l {r + p)\> e) < e, (20) 
pe[o,8 ] 

P( sup \ Vi {r) -Vi(T + p)\ > e) < e. (21) 

pe[o,s a ] 

We divide the set Q into six J^-measurable subsets. For each of these subsets we then 
define a pair of randomized stopping times {(p\,(p^), and we prove that, when restricted to 
each set, this pair is a fee-equilibrium, for some < k < 13. It will then follow that {ip\, tp^), 
when viewed as a randomized stopping time on Vt, is a 78e-equilibrium. The partition is 
similar to that in Laraki, Solan and Vieille (2005), and only the treatment on the last subset 
is different. 

Denote by ipi(t,e) an e-optimal stopping time of player i in the game T3_j(t); thus, the 
randomized stopping time ipi(t, e) is a punishment strategy against player 3 — i, as it ensures 
that his payoff will not exceed v^-i{t) + e. 

Part 1: The set A 1 := {X x (0) > vi(0)} n {^ 2 (0) > Z 2 (0)}. 

We prove that when restricted to the set Ai, the pair (ip\, ip^) that is defined as follows 
is a 4e-equilibrium: 
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• ip\ = 0: player 1 stops at time 0. 

• ip 2 = ifj 2 (5o,e): If player 1 does not stop before time So, player 2 punishes him in the 
game Ti((5o) that starts at time Sq. 

If no player deviates, the game is stopped by player 1, and the payoff is 

j(<pl,<p* 2 I Jo) = (Xi(0),*2(0)) on Al 

We argue that player 2 cannot profit by deviating. Indeed, let A2 be any non-randomized 
stopping time of player 2. Then on A\ 

7a(vl, A 2 I Jo) = Z 2 (0)l Ain{A2=0} + ^2(0)lA in {A 2 >o} < ^2(0) = 72O1, I Jo), 
and the claim follows. 

We now argue that on A\ player 1 cannot profit more than 4e by deviating from (p*. 
Let Ai be any non-randomized stopping time of player 1. Then by the definition of ip 2 , on 
Ai 

7i(Ai,(^2 I Jo) < E[Xi(Ai)l {Al<(5o} + (vi(S Q ) +e)l {<5o < Al} | J ]. 
Bv (fT3 |) . pi ji . and since Xi(0) > t>i(0) on A 1 , it follows that on A 1 

P( 7 i(Ai,^ I J ) > E[Xi(0)l {Al<&} + (JTi(O) + e)l {So < Xl} | J ] + e) < 2e. 

Since 7i(y>i,y>2 | Jo) = -Xi(O) on A\ it follows that 

P(A 1 n{ 7 i(Ai,^ I Jo) > 71(^,^2 l^o) + 2e}) <2e, (22) 

and the desired results follows. 

Part 2: The set A 2 := {Z 2 (0) > X 2 (0)} n {2^(0) > Yi(0)}. 

We prove that when restricted to the set ^2, the pair (<^, </?|) that is defined as follows 
is a 0-equilibrium: 

• ip\ = 0: player 1 stops at time 0. 

• (^2 = 0: player 2 stops at time 0. 
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If no player deviates, both players stop at time 0, and the payoff is 

l(<p*,(pli | Jb) = (Zi(0),Z 2 (0)) on A 2 . 

To see that player 1 cannot profit by deviating, fix an arbitrary non-randomized stopping 
time Ai for player 1. On A 2 one has 

71 (Ai,^ I To) = Zi(0)l {Al=o} + Yi(0)l {Al>0} < Zx(0) = 71 {vhti \ T ), (23) 
as desired. A symmetric argument shows that player 2 cannot profit by deviating either. 

Part 3: The set A 3 := {Yi(0) > Zi(0)} D {Y 2 (0) > v 2 (0)}. 

The case of the set A3 is analogous to Part 1: when restricted to A3, the pair of 
randomized stopping times in which player 2 stops at time 0, and player 1 plays an e- 
optimal stopping time ^i(<5o,e) in the game r 2 (<5o), is a ^-equilibrium. 

Part 4: The set A 4 := (Xi(0) > «i(0)} D {^ 2 (0) > ^(0)}. 

We prove that when restricted to the set A4, the pair (ip\, ip^) that is defined as follows 
is a 6e-equilibrium: 

• tpi(r, •) = r5o: player 1 stops at a random time between time and time 5q. 

• 1P2 = ip2(fio,£)'- If player 1 does not stop before time So, player 2 punishes him in the 
game Ti((5o) that starts at time So- 

If no player deviates, the game is stopped by player 1 before time Sq, and by (|19p the 
payoff is within 2e of (Xl(0), X 2 (0)): 

P(A A n {hiirt,^) ~ Xi(0)\ > e}) < e. (24) 

The same argument^ as in Part 1 shows that 

P(A 4 n{ 7 i(Ai,^ I T ) > 7i(v?*, ¥>2 I T ) + 3e}) < 3e. (25) 

3 The additional e arises because in Part 1 we had 71 (v*, ^2) — -Xi(O), whereas in Part 4 we have 
P(A 4 n {71 (<fit, <p* 2 ) < Xi(0) - e}) < e. 
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It follows that player 1 cannot profit more than 6e by deviating from ip\. 

We now argue that player 2 cannot profit more than 5e by deviating from ip\. Fix a 
non-randomized stopping time A2 for player 2. On A4 we have tp* < <5o, and ~P(A4 fl = 
A 2 }) = 0, and therefore 

72(^1, A 2 ) = E[X 2 (^)1 { ^ <A2} + y 2 (A 2 )l{A 2 <^} ! Fo] on A A . 
By (USD and (|20|> . 

P(72K,A 2 ) > E[(X 2 (0) + e)l M<M + (Y 2 (0) + £ )1 {A2< ^ } I Jb]) < 2e. 
Because X 2 (0) > 1^(0) on ^4 we have 

P(72(Vi,A 2 ) >X 2 (0)+e) <2e. 
Together with (f!M|) we deduce that 

P(72(^,A 2 ) > 72 (^,^) + 2 £ ) <3e, 

and the claim follows. 

Part 5: The set A 5 := {Xi(0) > «i(0)} \ (Ai U A 2 U A 3 U A A ). 

We claim that ~P(A 5 ) = 0. Since Xi(0) > v 1 (0) on A 5 , and since ^4 5 n = 0, it follows 
that X 2 (0) < Z 2 (0) on A 5 . Since A 5 (1 A 2 = 0, it follows that Zi(0) < Yi(0) on A 5 . Since 
A 5 n A 3 = 0, it follows that Y 2 (0) < v 2 (0) on A 5 . Since i 5 fl 4 4 = 0, it follows that 
^2(0) — ^2(0) on A5. Lemma [9] then implies that 

Y 2 (0) < v 2 (0) < max{X 2 (0), Y 2 (0)} = F 2 (0) on A 5 , 

which in turn implies that P(A^) = 0, as claimed. 

The union A\U A2U A3U A4U A$ includes the set {Xi(0) > v\(0)}. Thus, when restricted 
to this set, the game has a 7e-equilibrium. By symmetric arguments, a 6e-equilibrium exists 
on the set {Y 2 (0) > v 2 (0)}. We now construct a 13e-equilibrium on the remaining set, 
{X 1 (0) <v 1 (0)}n{Y 2 (0) <v 2 (0)}. 



15 



Part 6: The set A e := {Xi(0) < «i(0)} n {Y 2 (0) < u 2 (0)}. 

Fix r\ > 0, and for each i 6 {1,2} let <^ be a simple randomized stopping time with 
basis fj% and delay at most 8i that satisfies Eq. (fT8|) for every stopping time A3_j > (see 
Lemma [H]). Let ipi(fJ- 2 + ^2 5 e) (resp. tp 2 (fJ[ + <5i> e )) be a simple randomized e-optimal 
stopping time for player 1 in the game T 2 (fJ?2 + £2) (resp. in the game Ti{(jl[ + <5i)); that is, 
a stopping time that achieves the infimum in ([8]) up to e, for t = fi 2 + ^2 (resp. the infimum 
in (J7J) up to e, iov t = rf{ + 5\) . 

Set fj, v = min{/^, /i^}- We further divide Aq into six J 7 ^ -measurable subsets; the 
definition of (</?*, (p%) is different in each subset, and is given in the second and third columns 
of Table 1. Under (ip\,(p 2 ) the game will be stopped at time [fl or during a short interval 
after time fjP, if < 00, and will not be stopped if fj/ 1 = 00. 



Subset 




* 

V?2 


71(^1, ¥>|) 


-461 


= A 6 n {i4 < 






>Xi(/i")-2e 


-462 


= A 6 n {i4 < /x?} 




V 
f 2 


>yi(M")-2 £ 


A>3 


= A & n = $ = 00} 


00 


OO 


= 6 


A M 


= A 6 n = t4 < 00} n {z^D < YM)} 








A 65 


= A 6 n {i4 = ^ 2 <oo}n {z 2 (vt>) < x 2 (fl)} 






= ^l(M") 


A m 


= A 6 n { M ? = 4 < 00} n m(^) < ZiC^)} 
n{x 2 (^) < z 2 (/i")} 









Table 1: The randomized stopping times (</>*, v?^) on ^6, with the payoff to player 1. 



We argue that when restricted to Aq, the pair (<Pi,ipz) is a 13e-equilibrium. Note that 
the roles of the two players in the definition of ((p*,<p 2 ) are symmetric: ip\ = <p 2 on an d 
Aqq, and the role of player 1 (resp. player 2) in A%\ and ^4 is similar to the role of player 
2 (resp. player 1) in Aq 2 and A§§. To prove that (tp*, (p 2 ) is a 13e-equilibrium it is therefore 
sufficient to prove that the probability that player 1 can profit more than 3e by deviating 
from ip* is at most lOe. 

We start by bounding the payoff 71 (<p*, ip 2 \ F^n) (the bound that we derive appears on 
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the right-most column in Table 1), and by showing that 

7l(<P*,¥>2 I -?>") ^ Mf^) ~ 3e - V on A e \ A e3 . (26) 
We prove this in turn on each of the sets Aqi, . . . , Aqq\ 

• On Aqi we have fjP = fj%, and the game is stopped by player 1 between times and 
pP + 8% , so that by (fT9|) we have 

p(Au n{>n(<PiM I J» < ^i(^) -4) < ^ (27) 

By (fT3|) we have Xi(pP) > v\{pP) — i], and therefore ([26]) holds on Aqi. 

• On we have p v = and the game is stopped by player 2 between times p v and 
pP + 82, so that by ([2"Uj) we have 

P(A 62 n {71(^1, </>2 I < YiUP) ~ e » < £ - (28) 

By (fT5|) we have Xx{p T} ) < Yx(p n ) on yl62> so that by Lemma H we have > 
vx(p v ). It follows that (|2SD holds on Aq 2 . 

• On n o player ever stops, and therefore 71(93*, </?2 | ^>i) = £i- 

• On player 2 stops at time p v , and therefore 71(9?*, 99*. I -^>' 7 ) = ^1 (A* 17 ) • By Lemma 
[Till on Aq4 we have 

Vi (jjP) < maxiYxin^ZxfjjP)} = Yx(jjP), 
and therefore ([25]) holds on ^64. 

• On Aqs player 1 stops at time pJ 1 , and therefore 71(931,92*. | = -^lO-* 7 ')- By (fT3|) 
we have .X~i(// ? ) > vx(p v ) — 77, and therefore ([26]) holds on ^65- 

• On both players stop at time /i* 7 , and therefore 71(99^,992 | -?>')) = Zi{pP). By 
Lemma [TOl on this set we have 

vxOjP) < max{yi(^),Zi(^)} = Zx(jjP), 

and therefore ([25]) holds on ^66- 
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Fix a stopping time Ai for player 1. To complete the proof of Theorem [5] we prove that 
P(A 6 n{ 7 i(A 1 ,^)> 7 i(^,^) + 3e}) < lOe. 

• On the set Aq n {Ai < fi v } we have by the definition of /x^, since fi v < //[, by Lemma 
US and by (26]), 

71 (Ai,^ | T Xl ) = Xi(A0 

< v 1 (X 1 )-r ] (29) 

< E[wi(/i r? )l A6n { Al<M „ <00 } + Cl 1 J 4 6 n{Ai<^=oo} I -^Ai] - 

< JliPi,^ I -^aJ + 3e, 

where the last inequality holds by (f2U|) and because the payoff of player 1 on A^ is 

6- 

• On the set Aqi n {f/ 1 < Ai} we have by the definition of tp\ 

7i(Ai,<^2 I = E[Xi(Ai)l {Al <^ +5l} + + Sx) + e)l { ^ +Sl<Xl} \ T^]. 

By (USD, (EU and (JED we have 

P( 7 i(Ai,^ | J» > + 2e) < 2e. 

By (|27|) we deduce that 

P( 7 i(Ai,^ | J» > 7i(Vi,¥>a I + 3e) < 3e. (30) 

• On the set Aq2 H {/i* 7 < Ai} we have by the definition of ip\ 

7i(Ai, I = E^i^i) 1 ^"^!^} + y i( ( ^2) 1 {^<A 1 } I J>"1- 
By (JED, J2QII, since ^ < ^ on A 62 , and by ([IS]), 

P(7i(Ai,^ | Tp) > EKYxW) + e)l { ^<x 1<v *} + (W) + e)l {v -< Xl} \ F^]) < 2e. 
By d2HD we deduce that 

P(^ 62 n{7i(Ai ! ^ | J» >-n(<Pi,<P2 l^) + 2e}) <3e. (31) 
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• On the set ^3 n {/^ < Ai} we have = \% = 00, so that 

71(^1^2 I -fyO =€1 = 71(^1,^2 I on ^63 n < Ai}. (32) 

• On the set n {/jP < Ai} we have 

7l(Ai,^|^) = B[Z 1 (^)l {Xl= ^ } + Y 1 (^)l { ^ <Xl} ] (33) 

< y 1 (^) = 7l (^,^|^). 

• On the set n {fjP < Ai} we have by the definition of <j>\* 

7i(Ai,^2 I J>i) = E[Xi(Ai)l {Ml ,< Al<At „ +5l} + (uiO^ + <5i) + e)l {At „ +(5l < Al} | T^}. 
By (USD, (EH) and (USD we have 

P(A 6S n {fjP < Ai} n {71 (Ai,^ I Jy,) > + 2e}) < 2e. 

Because 71 (y^,^ | •TyO = -Xi (/■*'') on ^5, we obtain 

P(A>5 n {//" < Ai} n {71 (Ai,^ | J» > 71 (</>*, ^2 I + 2e » < 2e. (34) 

• On the set A&§ n < Ai} we have 

7i(Ai,¥>a l-TyO = Z l(^) 1 {Ai=M"} + Y l(^) 1 {/ 1 "<A 1 } 

< Z 1 (^)= 71 (^,^|^). (35) 

From (|2ID, ([31]), d33J, ([33]), flM} and ([35]) we deduce that on A 6 n {At" < Ai} 

P( 7 i(A l5 ^ I > 7l (<pl,<pl I M ") + 3 £ ) < lOe. (36) 

Because (f29j) and ([36]) hold for every stopping time Ai for player 1, it follows that (tp*,^) 
is a 13e-equilibrium on Aq, as desired. 

Proof of Theorem [6] To prove that if Z x (t) G co{Xi(i), Yi(i)} and Z 2 (i) G 
co{X2{t), Y2(t)} for every i > 0, then there is a pair of non-randomized stopping times 
that form an e-equilibrium, we are going to check where randomized stopping times were 
used in the proof of Theorem^ and we will see how in each case one can use non-randomized 
stopping times instead of randomized stopping times. 
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1. In Part 1 (and in the analogue part 3) we used a punishment strategy ipi(5o,e) that 
in general is a non-randomized stopping time. However, by Lemma [T5| when Z 2 (t) € 
co{ X 2 (t), Y 2 (t)} for every i > 0, this randomized stopping time can be taken to be 
non-randomized. 

2. In Part 4 we used, in addition to the punishment strategy ip 2 (5o, £ )> a simple random- 
ized stopping time for player 1. The set that we were concerned with in part 4 was 
the set A A := {Jfi(0) > ui(0)} n {X 2 (0) > Y 2 (0)}. Because Z 2 (0) € co{X 2 (0), F 2 (0)} 

X 2 (0) > Z 2 (0) > F 2 (0). 

But then the following pair of non-randomized stoping times is a 3e-equilibrium when 
restricted to A4: 

• ip\ := 0: player 1 stops at time 0. 

• (p 2 '■= tp2(So, s): if player 1 does not stop before time 5q, player 2 punishes him 
(with a non-randomized stopping time; see first item) in the game Ti((5o). 

3. In Part 6 randomization was used both for punishment (on Aqi, Aq 2 , Aq^ and Aq^) 
and for stopping (on A%\ and A§ 2 ). As mentioned above, under the assumptions of 
Theorem O for punishment one can use non-randomized stopping times. We now 
argue that one can modify the definition of (y^,^) on A§\ and Aq 2 so as to obtain 
a non-randomized equilibrium. Because of the symmetry between Aqi and Aq 2 , we 
show how to modify the construction only on A%\ . 

On Aqi we have ix[ < /i 2 , so that by (fT5|) we have 12(^1) < -^2(^2) • Because Z 2 {^[) £ 
co{X 2 (/^),Y 2 (//?)} it follows that K 2 (Mi) < z iil4) ^ x 2(/4')- But then the following 
pair of non-randomized stoping times is a 3e-equilibrium on Aqi: 

• (fi := fj,^: player 1 stops at time /z^. 

• (p 2 := ip 2 (ni + if player 1 does not stop before time \x[ + Si, player 2 
punishes him (with a non-randomized stopping time; see first item) in the game 

rM + 81). 
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A The result of Laraki and Solan (2005) 

As mentioned before, Laraki and Solan (2005) proved Theorem [5] for two-player zero-sum 
Dynkin games. We need the stronger version that is stated in Lemma where the payoff is 
conditioned on the a- algebra Tt- It turns out that the arguments used by Laraki and Solan 
(2005) prove this case as well, when one uses the following Lemma instead of Lemma 4 in 
Laraki and Solan (2005). 

Lemma 16 Let X be a right- continuous process. For every stopping time A and every 
positive T\-measurable function e there is a positive T\-measurable and bounded function 5 
such that: 

\X(X) - ELY(p) | < e, (37) 
for every stopping time p that satisfies A < p < A + 5. 

Proof. Because the process X is right continuous, the function w h-> E[X(A + w) \ J-\] 
is right-continuous at w = on Q, and it is equal to A (A) at w = 0. By defining 

5' = ^sup{w > 0: \X(X) -E[X(X + w) \ T x ]\ < e], 

we obtain a positive J^-measurable function such that (|37|) is satisfied for every stopping 
time p, A < p < A + 5' . The proof of the Lemma is complete by setting 5 = min{<5', 1}. ■ 

This Lemma can also be used to adapt the proof of Proposition 7 in Laraki and Solan 
(2005) in order to prove Lemma [51 which states that the value process is right continuous. 

One can use Lemma [TBI to improve some of the bounds given in Section HJ We chose not 
to use this Lemma in the paper, so as to unify the arguments given for the various bounds. 
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